skip to main content


Search for: All records

Creators/Authors contains: "Nguyen, Hung"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    Single-cell RNA sequencing (scRNA-Seq) is a recent technology that allows for the measurement of the expression of all genes in each individual cell contained in a sample. Information at the single-cell level has been shown to be extremely useful in many areas. However, performing single-cell experiments is expensive. Although cellular deconvolution cannot provide the same comprehensive information as single-cell experiments, it can extract cell-type information from bulk RNA data, and therefore it allows researchers to conduct studies at cell-type resolution from existing bulk datasets. For these reasons, a great effort has been made to develop such methods for cellular deconvolution. The large number of methods available, the requirement of coding skills, inadequate documentation, and lack of performance assessment all make it extremely difficult for life scientists to choose a suitable method for their experiment. This paper aims to fill this gap by providing a comprehensive review of 53 deconvolution methods regarding their methodology, applications, performance, and outstanding challenges. More importantly, the article presents a benchmarking of all these 53 methods using 283 cell types from 30 tissues of 63 individuals. We also provide an R package named DeconBenchmark that allows readers to execute and benchmark the reviewed methods (https://github.com/tinnlab/DeconBenchmark).

     
    more » « less
  2. Low-complexity nucleotide repeat sequences, which are implicated in several neurological disorders, undergo liquid–liquid phase separation (LLPS) provided the number of repeat units, n , exceeds a critical value. Here, we establish a link between the folding landscapes of the monomers of trinucleotide repeats and their propensity to self-associate. Simulations using a coarse-grained Self-Organized Polymer (SOP) model for (CAG) n repeats in monovalent salt solutions reproduce experimentally measured melting temperatures, which are available only for small n . By extending the simulations to large n , we show that the free-energy gap, ΔG S , between the ground state (GS) and slipped hairpin (SH) states is a predictor of aggregation propensity. The GS for even n is a perfect hairpin (PH), whereas it is a SH when n is odd. The value of ΔG S (zero for odd n ) is larger for even n than for odd n . As a result, the rate of dimer formation is slower in (CAG) 30 relative to (CAG) 31 , thus linking ΔG S to RNA–RNA association. The yield of the dimer decreases dramatically, compared to the wild type, in mutant sequences in which the population of the SH decreases substantially. Association between RNA chains is preceded by a transition to the SH even if the GS is a PH. The finding that the excitation spectrum—which depends on the exact sequence, n , and ionic conditions—is a predictor of self-association should also hold for other RNAs (mRNA for example) that undergo LLPS. 
    more » « less
    Free, publicly-accessible full text available June 13, 2024
  3. Flow-based manipulation of particles is an essential tool for studying soft materials, but prior work has nearly exclusively relied on using two-dimensional (2D) flows generated in planar microfluidic geometries. In this work, we demonstrate 3D trapping and manipulation of freely suspended particles, droplets, and giant unilamellar vesicles in 3D flow fields using automated flow control. Three-dimensional flow fields including uniaxial extension and biaxial extension are generated in 3D-printed fluidic devices combined with active feedback control for particle manipulation in 3D. Flow fields are characterized using particle tracking velocimetry complemented by finite-element simulations for all flow geometries. Single colloidal particles (3.4 μm diameter) are confined in low viscosity solvent (1.0 mPa s) near the stagnation points of uniaxial and biaxial extensional flow for long times (≥10 min) using active feedback control. Trap stiffness is experimentally determined by analyzing the power spectral density of particle position fluctuations. We further demonstrate precise manipulation of colloidal particles along user-defined trajectories in three dimensions using automated flow control. Newtonian liquid droplets and GUVs are trapped and deformed in precisely controlled uniaxial and biaxial extensional flows, which is a new demonstration for 3D flow fields. Overall, this work extends flow-based manipulation of particles and droplets to three dimensions, thereby enabling quantitative analysis of colloids and soft materials in complex nonequilibrium flows. 
    more » « less
    Free, publicly-accessible full text available July 1, 2024
  4. Abstract Recent advances in biochemistry and single-cell RNA sequencing (scRNA-seq) have allowed us to monitor the biological systems at the single-cell resolution. However, the low capture of mRNA material within individual cells often leads to inaccurate quantification of genetic material. Consequently, a significant amount of expression values are reported as missing, which are often referred to as dropouts. To overcome this challenge, we develop a novel imputation method, named single-cell Imputation via Subspace Regression (scISR), that can reliably recover the dropout values of scRNA-seq data. The scISR method first uses a hypothesis-testing technique to identify zero-valued entries that are most likely affected by dropout events and then estimates the dropout values using a subspace regression model. Our comprehensive evaluation using 25 publicly available scRNA-seq datasets and various simulation scenarios against five state-of-the-art methods demonstrates that scISR is better than other imputation methods in recovering scRNA-seq expression profiles via imputation. scISR consistently improves the quality of cluster analysis regardless of dropout rates, normalization techniques, and quantification schemes. The source code of scISR can be found on GitHub at https://github.com/duct317/scISR . 
    more » « less
  5. Advances in single-cell RNA sequencing (scRNAseq) technologies have allowed us to study the heterogeneity of cell populations. The cell compositions of tissues from different hosts may vary greatly, indicating the condition of the hosts, from which the samples are collected. However, the high sequencing cost and the lack of fresh tissues make single-cell approaches less appealing. In many cases, it is practically impossible to generate single-cell data in a large number of subjects, making it challenging to monitor changes in cell type compositions in various diseases. Here we introduce a novel approach, named Deconvolution using Weighted Elastic Net (DWEN), that allows researchers to accurately estimate the cell type compositions from bulk data samples without the need of generating single-cell data. It also allows for the re-analysis of bulk data collected from rare conditions to extract more in-depth cell-type level insights. The approach consists of two modules. The first module constructs the cell type signature matrix from single-cell data while the second module estimates the cell type compositions of input bulk samples. In an extensive analysis using 20 datasets generated from scRNA-seq data of different human tissues, we demonstrate that DWEN outperforms current state-of-the-arts in estimating cell type compositions of bulk samples. 
    more » « less
  6. Abstract

    Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present substantial challenges in single-cell analysis. Here we introduce a new method, named single-cell Clustering using Autoencoder and Network fusion (scCAN), that can overcome these challenges to accurately segregate different cell types in large and sparse scRNA-seq data. In an extensive analysis using 28 real scRNA-seq datasets (more than three million cells) and 243 simulated datasets, we validate that scCAN: (1) correctly estimates the number of true cell types, (2) accurately segregates cells of different types, (3) is robust against dropouts, and (4) is fast and memory efficient. We also compare scCAN with CIDR, SEURAT3, Monocle3, SHARP, and SCANPY. scCAN outperforms these state-of-the-art methods in terms of both accuracy and scalability. The scCAN package is available athttps://cran.r-project.org/package=scCAN. Data and R scripts are available athttp://sccan.tinnguyen-lab.com/

     
    more » « less